CLAIMS: 



What is claimed is: 



, . A computer implemented me/hod of adding each of a first plurality of 
elements of a firs, packed data together to/roduce a firs, result comprising the steps of: 
producing a first plurality of port/ons of a plurality of partial products ustng a 
. first plurality of partial product selecto/s in a multiplier, each of the firs, plurality of 
I portions of the plurality of partial Deducts being zero; and 

5 inserting each of the first oLlity of elements Into one of a second plurality of 

, portions of the plurality of parti/products using a second portion of a plurahty of 
s partial product selectors, each/f the second plurahty of portions of the plurality of 
9 partial products being aligaek: and 



10 adding each/f the /irst 

1 1 including a field liaving 



>ui 



lurality of elements together to produce the first result 
itj. of the first plurality of elements. 



step of shifi 
the least sigl 



The coftir/utet implemented method of Claim 1 further comprising the 
the fi/XsA to produce a second result having the field aligned with 
lificantJBit of the \econd result. 



ng 



2 
3 



3. Ttfe compmer implemented method of Claim 1 wherein the first plurality 
of elements eaclf consist of eight elements, each of the firs, plurality of element betng 
an unsigned by 



4. 



The computer implemented method of Claim 1 wherein the multiplier 



2 has zero as one operand. 
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5. The computer implemented ijfethod of Claim 1 further comprising the 

steps of: 

■producing a fourth packed data/fiaving a fourth plurality of elements and a 
plurality of sign bits, each of the fourth plurality of elements and the plurality of sign 
bits being computed by subtracting one of a second plurality of elements of a second 
packed data from a corresponding one of a third plurality of elements of a third packed 
data; and 

producing the first packed data, each of the first plurality of elements being 
computed by subtracting one of the fourth plurality of elements from the corresponding 

10 one of an at least one element, if the corresponding one of the plurality of sign bits is in 

1 1 a first state; and addmg orf of tfye fourth plurality of elements from the corresponding 
one of the at lepst ofic elqjhnent Jf the corresponding one of the plurality of sign bits is in 



12 
13 



10 



a second stat 



h^/compper implemented method of Claim 1 further comprising the 

steps of/ 

eing a fourth packed data having a fourth plurality of elements, each of the 
fourth plurality of elements being maximum value of one of a second plurality of 
elements/ of a second packed data and one of a third plurality of elements of a third 
packed/data; 

producing a fifth packed data having a fifth plurality of elements, each of the 

8 fifth Plurality of elements being minimum value of one of the second plurality of 

9 elements and one of the third plurality of elements; and 
producing the first packed data, each of the first plurality of elements being one 



1 1 off the fifth plurality of elements minus corresponding one of the fourth plurality of 

12 elements. 
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7. The computer implemented metlySd of Claim I further comprising the 
steps of: 

producing a fourth packed data having a fourth plurality of elements, each of the 
fourth plurality of elements being one of a/second plurality of elements of a second 
packed data minus the corresponding oy of a third plurality of elements of a third 
packed data saturated at zero; 

producing a fifth packed datif having a fifth plurality of elements, each of the 
fifth plurality of elements being ofte of the third plurality of elements minus the 
corresponding one of the second plurality of elements saturated at zero; 

producing the first packed data, each of the first plurality of elements being a 



bitwise logical OR of oife of the 
of the fifth plurality Df element 



urth plurality of elements and the corresponding one 



8. / An app 
having a fi/st plurality 

a multiplier 
partial products us 




r performing a horizontal add of a first packed data 
ents comprising: 

produce a first plurality of portions of a plurality of 
ality of partial product selectors, each of the first 



plurality of portions of the plurality of partial products being zero and inserting each of 
the first plurality/of elements into one of a second plurality of portions of the plurality 
of partial products using a second plurality of partial product selectors, each of the 
second plurality of portions of the plurality of partial products being aligned; and 

an adder tree configured to receive the plurality of partial products and produce a 



first result inc 
9. 

selectors are 



uding a field having the sum of the first plurality of elements. 

The apparatus of Claim 8 wherein the first plurality of partial product 
oupled to receive a portion of an operand, the operand being zero. 
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10. The apparatus of Claim 9 wKerein the operand has a single element. 

1 1 . The apparatus of Claim 9 ^herein the operand is a packed data. 

1 2. The apparatus of Claim p wherein the adder tree comprises: 
a plurality of carry save adders operable to receive the plurality of partial 

products and produce a set of carry and sum signals; and 

a carry lookahead adder operable to receive the set of carry and sum signals and 
generate the first result, the first result being the sum of the set of carry and sum signals. 



13. The apparatusMClaim 8 further comprising a shifter coupled to receive 
the first result and shift ^first result to {jroduce a second result having the field 
aligned with the least/ignificAnt bifyof the second result. 



14. The apparatus of Clkm 8 further comprising: 

a first cii/cuit configured to receive a second packed data having a second 
plurality of elenWs andjKhird packdd data having a third plurality of elements to 
produce a fourth packed data having a fourth plurality of elements, each of the fourth 
plurality of elements beifag maximum value of one of the second plurality of elements 
and one of the third plurality of elements; 

a second circuit configured to receive the second packed data and the third 
packed data to produofe a fifth packed data having a fifth plurality of elements, each of 
the fifth plurality of dements being minimum value of one of the second plurality of 
elements and one oy the third plurality of elements; and 

a third circuit configured to receive the fourth packed data and the fifth packed 
data to produce thjb first packed data, each of the first plurality of elements being one of 
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* 



13 the fifth plurality of elements minus ctprresponding one of the fourth plurality of 

14 elements. 



m 
Si 



15. The apparatus of Cla&m 8 further comprising: 
a first circuit configured toireceive a second packed data having a second 
plurality of elements and a third packed data having a third plurality of elements to 
produce a fourth packed data having a fourth plurality of elements, each of the fourth 
plurality of elements being one/ofjthe second plurality of elements minus the 
corresponding one of the tjjifd plurality of elements saturated at zero; 

'eceive the second packed data and the third 
data having a fifth plurality of elements, each of 
Dne of the third plurality of elements minus the 
plurality of elements saturated at zero; and 
to receive the fourth packed data and the fifth packed 
data to produce the first parked data, each of the first plurality of elements being a 

13 bitwise logical OR of one/of the fourth plurality of elements and the corresponding one 

14 of the fifth plurality of elements. 



3 
4 
5 
6 

7 a second circuit/onfigtred 

8 packed data to prod 

9 the fifth plurality o^ element 

10 corresponding one 

1 1 a third circuit 
12 



/pac 
' bei/g 
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